A Corpus-based Study of Latinate Words in Contemporary English
نویسندگان
چکیده
The English language has borrowed extensively from Latin. Despite their widely acknowledged importance, however, Latinate words are under-studied especially quantitatively according to their use across different linguistic settings. In this paper, we report a corpus-based survey of Latinate words and their use in contemporary English. The objective is to chart the use and distribution of Latinate words across a set of different text categories and subject domains in order to identify patterns of variation across such settings. The British National Corpus is chosen for its large size and wide variety of texts. Our results will show that the density of Latinate words, calculated as the proportion of Latinate words amongst all running tokens, successfully separates speech from writing and, within writing, academic prose from non-academic prose. The correlation between Latinate density and degrees of text formality is then compared with human ranking and also analyzed by the linear regression analysis. The results suggest a strong correlation between Latinate density and degrees of text formality. Our results also show that even different domains have their own preferences for the use of Latinate words. Our investigation is significant in two respects. Firstly, it is probably the first corpus-based, large scale survey of the use of Latinate words in contemporary English. Secondly, it measures the density of Latinate words both across different text categories as an important stylistic feature and across a set of different domains as a subject-specific differentia. Such an investigation lends itself to our understanding of the impact of Latin on contemporary English and the empirical findings will also contribute to practical applications such as automatic text classification and genre detection.
منابع مشابه
Hedges in English for Academic Purposes: A Corpus-based study of Iranian EFL learners
Hedges, as tools to express tentativeness and doubt, have been studied in plenty of research papers in the Iranian EFL research setting. However, their use in a learner corpus, portraying Iranian learner English, is in need of more research attention. With this end in view, this study aimed at investigating how Iranian EFL learners who have majored in English-related fields in Iran deployed hed...
متن کاملA Corpus-based Analysis of Epistemic Stance Adverbs in Essays Written by Native English Speakers and Iranian EFL Learners
Academic essays entail taking a stance on the truth value of propositions. Epistemic adverbs deal with the speaker's assessment of the truth value of propositions. Employing a corpus-based approach with descriptive statistics and qualitative description, this study explored the use of epistemic stance adverbs in academic essays written by native English speakers and Iranian EFL learners. Follow...
متن کاملA Corpus-driven Food Science and Technology Academic Word List
The overarching goal of this study was to create a list of the most frequently occurring academic words in Food Science and Technology (FST). To this end, a 4,652,444-word corpus called Food Science and Technology Research Articles (FSTRA), which included 1,421 research articles (RAs) randomly selected from 38 journals across five sub-disciplines in FST, was developed. Frequency and range-based...
متن کاملDo We Need Discipline-Specific Academic Word Lists? Linguistics Academic Word List (LAWL)
This corpus-based study aimed at exploring the most frequently-used academic words in linguistics and compare the wordlist with the distribution of high frequency words in Coxhead’s Academic Word List (AWL) and West’s General Service List (GSL) to examine their coverage within the linguistics corpus. To this end, a corpus of 700 linguistics research articles (LRAC), consisting of approximately ...
متن کاملMaterial Development and English for Academic Purposes Word Lists; a Reductionist Approach
Nagy (1988) states that vocabulary is a prerequisite factor in comprehension. Drawing upon a reductionist approach and having in mind the prospects for material development, this study aimed at creating an English for Academic Purposes Word List (EAPWL). The corpus of this study was compiled from a corpus containing 6479 pages of texts, 2,081,678 million tokens (running words) and 63825 types (...
متن کامل